NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PivotAlign: Improve Semi-Supervised Learning by Learning Intra-Class Heterogeneity and Aligning with Pivots

Yi, Lingjie; Sun, Tao; Zhang, Yikai; Zheng, Songzhu; Lyu, Weimin Lyu; Ling, Haibin Ling; Chen, Chao (February 2025, IEEE/CVF Winter Conference on Applications of Computer Vision (WACV))

Free, publicly-accessible full text available February 28, 2026
Learning to Segment from Noisy Annotations: A Spatial Correction Approach

Yao, Jiachen; Zhang, Yikai; Zheng, Songzhu; Goswami, Mayank; Prasanna, Prateek; Chen, Chao (May 2023, International Conference on Learning Representations)

Noisy labels can significantly affect the performance of deep neural networks (DNNs). In medical image segmentation tasks, annotations are error-prone due to the high demand in annotation time and in the annotators' expertise. Existing methods mostly tackle label noise in classification tasks. Their independent-noise assumptions do not fit label noise in segmentation task. In this paper, we propose a novel noise model for segmentation problems that encodes spatial correlation and bias, which are prominent in segmentation annotations. Further, to mitigate such label noise, we propose a label correction method to recover true label progressively. We provide theoretical guarantees of the correctness of the proposed method. Experiments show that our approach outperforms current state-of-the-art methods on both synthetic and real-world noisy annotations.
more » « less
Full Text Available
Learning to Segment from Noisy Annotations: A Spatial Correction Approach

Yao, Jiachen; Zhang, Yikai; Zheng, Songzhu; Goswami, Mayank; Prasanna, Prasanna; Chen, Chao (May 2023, International Conference on Learning Representations)

Noisy labels can significantly affect the performance of deep neural networks (DNNs). In medical image segmentation tasks, annotations are error-prone due to the high demand in annotation time and in the annotators' expertise. Existing methods mostly tackle label noise in classification tasks. Their independent-noise assumptions do not fit label noise in segmentation task. In this paper, we propose a novel noise model for segmentation problems that encodes spatial correlation and bias, which are prominent in segmentation annotations. Further, to mitigate such label noise, we propose a label correction method to recover true label progressively. We provide theoretical guarantees of the correctness of the proposed method. Experiments show that our approach outperforms current state-of-the-art methods on both synthetic and real-world noisy annotations.
more » « less
Full Text Available
A Multimodal Transformer: Fusing Clinical Notes with Structured EHR Data for Interpretable In-Hospital Mortality Prediction

Lyu, Weimin; Dong, Xinyu; Wong, Rachel; Zheng, Songzhu; Abell-Hart, Kayley; Wang, Fusheng; Chen, Chao (November 2022, AMIA Annual Symposium proceedings)

Deep-learning-based clinical decision support using structured electronic health records (EHR) has been an active research area for predicting risks of mortality and diseases. Meanwhile, large amounts of narrative clinical notes provide complementary information, but are often not integrated into predictive models. In this paper, we provide a novel multimodal transformer to fuse clinical notes and structured EHR data for better prediction of in-hospital mortality. To improve interpretability, we propose an integrated gradients (IG) method to select important words in clinical notes and discover the critical structured EHR features with Shapley values. These important words and clinical features are visualized to assist with interpretation of the prediction outcomes. We also investigate the significance of domain adaptive pretraining and task adaptive fine-tuning on the Clinical BERT, which is used to learn the representations of clinical notes. Experiments demonstrated that our model outperforms other methods (AUCPR: 0.538, AUCROC: 0.877, F1:0.490).
more » « less
Full Text Available
Topological Detection of Trojaned Neural Networks

Zheng Songzhu; Zhang Yikai; Wagner Hubert; Goswami Mayank; Chen Chao (December 2021, Advances in neural information processing systems)

Deep neural networks are known to have security issues. One particular threat is the Trojan attack. It occurs when the attackers stealthily manipulate the model's behavior through Trojaned training samples, which can later be exploited. Guided by basic neuroscientific principles we discover subtle -- yet critical -- structural deviation characterizing Trojaned models. In our analysis we use topological tools. They allow us to model high-order dependencies in the networks, robustly compare different networks, and localize structural abnormalities. One interesting observation is that Trojaned models develop short-cuts from input to output layers. Inspired by these observations, we devise a strategy for robust detection of Trojaned models. Compared to standard baselines it displays better performance on multiple benchmarks.
more » « less
Full Text Available
Learning with Feature-Dependent Label Noise: A Progressive Approach

Zhang, Yikai; Zheng, Songzhu; Wu, Pengxiang; Goswami, Mayank; Chen, Chao . (May 2021, Ninth International Conference on Learning Representations)
null (Ed.)
Label noise is frequently observed in real-world large-scale datasets. The noise is introduced due to a variety of reasons; it is heterogeneous and feature-dependent. Most existing approaches to handling noisy labels fall into two categories: they either assume an ideal feature-independent noise, or remain heuristic without theoretical guarantees. In this paper, we propose to target a new family of feature-dependent label noise, which is much more general than commonly used i.i.d. label noise and encompasses a broad spectrum of noise patterns. Focusing on this general noise family, we propose a progressive label correction algorithm that iteratively corrects labels and refines the model. We provide theoretical guarantees showing that for a wide variety of (unknown) noise patterns, a classifier trained with this strategy converges to be consistent with the Bayes classifier. In experiments, our method outperforms SOTA baselines and is robust to various noise types and levels.
more » « less
Full Text Available
Learning with Feature-Dependent Label Noise: A Progressive Approach

Zhang, Yikai; Zheng, Songzhu; Wu, Pengxiang; Goswami, Mayank; Chen, Chao (May 2021, International Conference on Learning Representations)
null (Ed.)
Label noise is frequently observed in real-world large-scale datasets. The noise is introduced due to a variety of reasons; it is heterogeneous and feature-dependent. Most existing approaches to handling noisy labels fall into two categories: they either assume an ideal feature-independent noise, or remain heuristic without theoretical guarantees. In this paper, we propose to target a new family of feature-dependent label noise, which is much more general than commonly used i.i.d. label noise and encompasses a broad spectrum of noise patterns. Focusing on this general noise family, we propose a progressive label correction algorithm that iteratively corrects labels and refines the model. We provide theoretical guarantees showing that for a wide variety of (unknown) noise patterns, a classifier trained with this strategy converges to be consistent with the Bayes classifier. In experiments, our method outperforms SOTA baselines and is robust to various noise types and levels.
more » « less
Full Text Available
Learning with Feature-Dependent Label Noise: A Progressive Approach

Zhang, Yikai; Zheng, Songzhu; Wu, Pengxiang; Goswami, Mayank; Chen, Chao (May 2021, International Conference on Learning Representations)
null (Ed.)
Label noise is frequently observed in real-world large-scale datasets. The noise is introduced due to a variety of reasons; it is heterogeneous and feature-dependent. Most existing approaches to handling noisy labels fall into two categories: they either assume an ideal feature-independent noise, or remain heuristic without theoretical guarantees. In this paper, we propose to target a new family of feature-dependent label noise, which is much more general than commonly used i.i.d. label noise and encompasses a broad spectrum of noise patterns. Focusing on this general noise family, we propose a progressive label correction algorithm that iteratively corrects labels and refines the model. We provide theoretical guarantees showing that for a wide variety of (unknown) noise patterns, a classifier trained with this strategy converges to be consistent with the Bayes classifier. In experiments, our method outperforms SOTA baselines and is robust to various noise types and levels.
more » « less
Full Text Available
A Topological Filter for Learning with Label Noise

Wu, Pengxiang; Zheng, Songzhu; Goswami, Mayank; Metaxas, Dimitris; Chen, Chao (December 2020, The Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS))

Full Text Available
Error-Bounded Correction of Noisy Labels

Zheng, Songzhu; Wu, Pengxiang; Goswami, Aman; Goswami, Mayank; Metaxas, Dimitris N.; Chen, Chao (December 2020, Proceedings of Machine Learning Research)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records